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COMMUTING BIRTH-AND-DEATH PROCESSES 

By Steven N. Evans 1 , Bernd Sturmfels 2 and Caroline Uhler 3 

University of California at Berkeley 

We use methods from combinatorics and algebraic statistics to 
study analogues of birth-and-death processes that have as their state 
space a finite subset of the m-dimensional lattice and for which the m 
matrices that record the transition probabilities in each of the lattice 
directions commute pairwise. One reason such processes are of in- 
terest is that the transition matrix is straightforward to diagonalize, 
and hence it is easy to compute n step transition probabilities. The 
set of commuting birth-and-death processes decomposes as a union 
of toric varieties, with the main component being the closure of all 
processes whose nearest neighbor transition probabilities are posi- 
tive. We exhibit an explicit monomial parametrization for this main 
component, and we explore the boundary components using primary 
decomposition. 

1. Introduction. Birth-and-death processes are among the simplest Markov 
chains. In discrete time, they model a particle that wanders back and forth 
on a sub-interval of the integers by taking unit size steps. Birth-and-death 
processes arise in fields ranging from ecology, where the points in the state 
space represent population size, and at each point in time only a single birth 
or death event can occur, to queuing theory where the states represent the 
number of individuals waiting in a queue and at any time one individual ei- 
ther joins or leaves the queue. The finite time behavior of a birth-and-death 
process is easy to study because the transition matrix P, whose entry P(i,j) 
is the probability of going from state i to state j in one step, is tri-diagonal. 
This means that the matrix P can be diagonalized using a related family of 
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orthogonal polynomials, and this enables the computation of the power P n 
whose entries are the probabilities of going from one state to another in n 
steps [13, 14]. 

It is natural to consider Markov chains that have as their state space 
products of intervals in higher-dimensional integer lattices. For instance, 
the ecology model could be extended to a situation in which individuals in 
the population have a type and one keeps track of the number of individuals 
of each type, or the queuing theory example generalizes to one where there 
are several servers, each with their own set of customers, and one follows 
the respective queue lengths. 

There are also higher-dimensional models where one of the coordinates 
describes the quantity of primary interest while the others describe a fluc- 
tuating environment or background that modulates the dynamics of that 
quantity. This is the point of view taken in quasi-birth- and- death processes 
[15, 16], where the state of the primary variable is usually called the level, 
while the state of the subsidiary ones is the phase. Such models have been 
used, inter alia, to model queues in random environments and queues where 
the service times and inter-arrival times have phase-type distributions. The 
setting here is most often that of continuous time, but many of the same 
considerations apply to discrete time. A discussion of the discrete time case 
and its connection with matrix-valued orthogonal polynomials is given in 
[10]. There is also a huge literature in finance, economics and engineering 
on similar processes, where the terminology used is usually that of regime 
switching or stochastic volatility models. The setting there is, again, often 
in continuous time, and also the primary variable often has a continuous 
state space. However, numerical computations for such a model often in- 
volve approximation by one with discrete time and discrete state space. A 
few representative examples are [1, 4, 7, 11, 17, 19, 21, 22]. 

Unfortunately, even when such higher-dimensional models have only near- 
est neighbor transitions that are analogous to those of birth-and-death chains, 
it is no longer true that they are necessarily reversible, and so there is no 
hope that there will be a straightforward spectral decomposition like that 
afforded by the use of orthogonal polynomials. It is, therefore, natural to 
seek special cases where one can still recover something of the classical one- 
dimensional theory. 

One case in which this is possible is if, when we write the transition 
matrix as the sum of matrices, one for the transitions in each coordinate 
direction, the resulting collection of matrices commute. We motivate our 
study by examining the two-dimensional case in Section 2. After deriving 
the algebraic constraints for commuting matrices, we show how spectral 
methods may still be used to compute n step transition probabilities for 
a process on a two-dimensional grid. The same approach extends without 
difficulty to higher-dimensional grids. 
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With this observation in mind, we aim to understand what restrictions 
are placed on a Markov chain by the requirement that the matrices appear- 
ing in such a decomposition commute. In Section 3, we obtain an essentially 
unique parametrization of such a commuting model under the assumption 
that all nearest neighbor transition probabilities are positive; and, under the 
same assumption, we characterize the minimal number of constraints on the 
transition probabilities that are necessary to ensure commutation. We then 
use methods from linear algebra, matroid theory and algebraic statistics 
to further analyze the binomial ideal generated by the commutation condi- 
tion. In Section 4, we determine for which grid graphs the matrix describing 
the parametrization is unimodular. For these lattices, we derive an explicit 
graphical representation of the Graver basis, which is the most inclusive in 
the hierarchy of lattice bases in [5], Section 1.3. In Section 5, we explore the 
extraneous boundary components of our commuting variety. These corre- 
spond to families of commuting birth-and-death processes having some zero 
transition probabilities which are not limits of commuting birth-and-death 
processes with positive transition probabilities. This underlines the applica- 
bility of binomial primary decomposition in probability and statistics, well 
beyond the contexts envisioned by Diaconis, Eisenbud and Sturmfels in [3]. 

2. Motivation: Birth-and-death processes in dimension two. Consider a 
discrete-time, time-homogeneous Markov chain Z = (Zi-)?L that has as its 
state space the finite two-dimensional grid E := {0, 1, . . . , m} x {0, 1, . . . , n}. 
Suppose the chain makes jumps of size one either upwards, downwards, to 
the right or to the left. In other words, if we impose a graph structure on 
E by declaring that two states and are connected by an edge 

if and only if \i' — i"\ + \f — j"\ =1, then the chain can only make "nearest 
neighbor" jumps. If we draw E in the plane and include the edges as intervals 
of unit length, then the resulting figure is made up of to x n squares, and so 
we refer to E as the to x n grid. 

The dynamics of Z are specified by the transition probabilities 
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These transition probabilities are nonnegative real numbers that satisfy 
Lij + Rij + Dij + Uij < 1. This inequality is allowed to be strict. As usual, 
strict inequality is interpreted in terms of an adjoined absorbing state t with 
F{Z k+1 = \\Z k = = 1 - (Lij + Rij + Dij + Uij). 

The transition matrix P of the Markov chain Z has format \{m + l)(n + 
1)] x [(to + l)(rt + 1)]. It may be written as a sum of two matrices, P = 
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Ph + P V i one for the horizontal moves and one for the vertical moves. The 
horizontal matrix Ph commutes with the vertical matrix P v if and only if 
the following four constraints hold for all index pairs € {0, 1, . . . ,m — 
1} x{0,l,...,n-l}: 



(2.1) 



Di,j+lRi,j 
Di+lJ+lLi+lj 



Ri,j+lP > i+l,j+l 
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i+X,j+lDi,j+l 
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(up-right); 
(down-right); 

(down- left); 
(up-left). 



In English, for each corner of a square in the grid, the probability of going 
from that corner to the diagonally opposite corner of the square in two steps 
is the same for the two possible paths. 

Example 2.1 (m = 2 and n = 1). We label the vertices of the 2x1 grid 
as follows. 
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The 6x6 transition matrix P = Ph + P v is the sum of the horizontal 
transition matrix 
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and the vertical transition matrix 
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%3 , Dij , Uij are all strictly 



Suppose that the 14 transition probabilities Lij,R. 
positive. Then the commuting relations (2.1) are found to be equivalent to 
the following rank constraints: 
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Indeed, the eight equations in (2.1) are among the twelve 2 x 2-minors of 
these two 2 x 4-matrices. The constraints (2.2) imply that our commuting 
variety {PhPv = PvPh} nas the parametrization 
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An analogous parametrization for commuting birth-and-death processes on 
larger two-dimensional grids and in higher dimensions will be derived in 
Section 3 and further studied in Section 4. The analogues of the parameters 
hi, hi, vi and the parameters straightforward to compute given the 

transition matrix P. We note that if the transition probabilities are allowed 
to be zero then (2.3) is insufficient because (2.1) can be satisfied even if (2.2) 
fails. The identification of such boundary phenomena is the content of our 
algebraic discussion in Section 5. 

The parametrization (2.3) of the transition matrix P in Example 2.1 can 
be rewritten as 

(2.4) P h = AQ h A~ 1 and P V = AQ V A~ 1 , 

where the three matrices appearing on the right-hand side are 
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As we shall see in Theorem 3.1, the matrix parametrization (2.4) is valid 
for the m x ra-grid. Here A is a diagonal matrix of size [m + l)(n + 1), 
the matrix Q v is block diagonal with m + 1 identical (n + 1) x (n + 1)- 
blocks R v that are symmetric and tri-diagonal and Qh is block diagonal 
with n + 1 identical (m + 1) x (m + 1) -blocks that are symmetric and 
tri-diagonal. Note that in order to make the block diagonal structure of the 
matrix Qh apparent, it is necessary to re-order the rows and columns as 
follows: (0,0), (1,0), (2,0), (0,1), (1,1), (2,1). Thus the tri-diagonal matrices 
Rh and R v satisfy 

and 

Qv{(i',j'W,j")) 

By the spectral theorem for real symmetric matrices [12], Theorem 4.1.5, 
the rth power of the matrix Rh has entries 

m 

Ri(i',i'')=j2 x h,ku h , k (^hMn, 

fc=0 

where A^o. ^ft,i> • • • , ^h,m are the eigenvalues of Rh and the u^k are the 
corresponding orthonormalized eigenvectors. With analogous notation, for 
any positive integer r, we have 

n 

R r v (i',i") = Y,K^vAj>vAj")- 



R h (i',i"), iif=j", 
0, otherwise, 



R v (j',j"), if i'=i", 
0, otherwise. 
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Now, because Ph and P v commute, the i-step transition probability matrix 
P is given by 



V 



s=0 ^ ' 

= Y^( l \{AQ h A^Y{AQ v A- x ) 

s=0 ^ ' 

„_n V / 



By combining the above formulas, we obtain the following simple expression 
for the t-step transition probabilities of the Markov chain Z: 



^ ^2(>>h,k + K,e) t u h ,k('t)uh,k('t / )u v /(j')u Vt e(j") 
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Moreover, note that for a constant c > and / the identity matrix, 



a i',f 



k I 



a i"j>n 



so we may readily compute the t-step transition probabilities of certain 
chains for which the particle has a constant positive probability of not mov- 
ing from its current position at each step. The calculations we have just done 
extend to the finite higher-dimensional grids E to be discussed in the next 
section. The problem of finding the t-step transition probabilities for these 
models reduces to finding the eigenvalues and eigenvectors of a collection of 
symmetric tri-diagonal matrices whose nonzero entries are the parameters 
W(u,v) in Theorem 3.1. 

In our definition of the Markov chain Z above, we did not require that 
the rows of the transition matrix P add to 1. In order to get a feeling for 
the effect of imposing that extra constraint, we return to the case m = 2 
and re = 1 in Example 2.1. The rows of P add to 1 if and only if 1 is an 
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eigenvalue of the symmetric matrix 



(0,0) (0,1) (1,0) (1,1) (2,0) (2,1) 



Qh + Q 



(0,0) 
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vi hi 

v 1 hi 

hi vi h 2 

hi vi /i 2 
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with corresponding right, and hence also left, eigenvector (a 00 , a 01 , a 10 , a 



The preceding matrix is irreducible when the parameters hi,h 2 , v\ are 
positive, and the Perron-Frobenius theorem [12], Theorem 8.4.4, guarantees 
that the eigenvalue with largest absolute value is positive, with an eigenvec- 
tor that has positive entries and is unique up to a constant multiple. Con- 
sequently, if positive parameters hi,h 2 ,Vi are given, then replacing them by 
bhi,bh 2 ,bvi for a suitable positive constant b permits the choice of parame- 
ters a>ij that are unique up to an irrelevant constant multiple such that the 
rows of P sum to 1 . More generally, replacing hi,h 2 , v\ by bhi , bh 2 , bvi for 
a suitable positive constant b permits the choice of parameters a,j that are 
unique up to an irrelevant constant multiple such that the rows of P sum 
to 1 — c for a given constant < c < 1, in which case the rows of the ma- 
trix cl + P sum to 1 . These considerations extend to larger two-dimensional 
grids and to higher dimensions. 

3. Parametrization. Suppose now that we have a discrete-time, time- 
homogeneous Markov chain with state space 



The m-dimensional grid E indexes the rows and columns of the transition 
matrix P. We assume that P(u, v) = unless u ~ v by which we mean that 
u — v G {±efc} for some k £ {1, . . . , m}. Here is the standard basis vector 
with entry 1 in the kth coordinate and elsewhere. Thus P describes a 
Markov chain with nearest neighbor transitions on the graph with vertices E 
where two vertices u and v are connected by an edge if u ~ v. For 1 < k < m, 
define a matrix P^ by 



These matrices are the analogues of the matrices P^ and P v for the two- 
dimensional grid. The requirement that the matrices Pj and Pj commute 



20 ' °21 I 



£:={0, l,...,ni} x {0, l,...,n 2 } x ••• x {0, 1, . . . ,n m }. 




if u — v G {±efc} 
otherwise. 
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for all pairs of indices 1 < i < j < m is equivalent to the condition that the 
following quadratic expressions vanish for all i,j and it: 

P(u,u + ei)P(u + ei,u + ei + ej) 

— P(u,u + ej)P(u + ej, it + e% + ej), 
P(u,u + ei)P{u + ei,u + ei - ej) 

— P(u, it — ej)P{u — ej,u + ei — ej), 

(3.1) 

P(u,u- ei)P{u - ei,u- ei + ej) 

— P(u, it + ej)P{u + ej,u — ei + ej), 
P(u, it — ei)P(u — ei,u — ei — ej) 

— P(u, it — ej)P{u — ej,u — ei — ej). 

Our aim is to solve this system of polynomial equations for the unknowns 
P(u,v). The next theorem offers such a solution under the assumption that 
the unknowns are strictly positive. 

Theorem 3.1. Suppose that P(u, v) > for all u,v S E such that u ~ v. 
Then the matrices P±,P2, ■ ■ ■ ,P m commute pairwise if and only if 

(3.2) P(u,v) = a u ■ W(u,v) ■ a" 1 , u,v£E, 

for some collection of constants a u andW(u,v), u,v G E, that satisfy W(u',v') = 
W(u" , v") when v' — u' £ {ie^} for some k and {u" , v"} = {u' + w,v'+ w} 
for some w S X^fc^ e ^- The constants a u are unique up to a common mul- 
tiple, and the constants W(u,v) are unique. 

Example 3.2 (2x1 grid). If m = 2, n\ = 2 and n 2 = 1, then this is 
the parametrization (2.3) in Example 2.1 with hi = W((0,0), (1,0)), /12 = 
W((1,0), (2, 0)) and Vl = W((0,0), (0, 1)). 

Proof of Theorem 3.1. The sufficiency of the stated condition is 
straightforward. For example, 

P(u,u + ei)P(u + ei,u + ei + ej) 

= a u W{u,u + ei)a-l ei a u+ei W(u + ei,u + ei + ej)a-\ eiJre] 

= a u W(u,u + ej)a~l e] a u+ej W{u + ej,u + e t + ej)a~l e . +e . 

= P(u,u + ej)P(u + ej,u + ei + ej), 

because W(u, u + ei) = W(u + ej,u + ei + ej) and W(u, u + ej) = W(u + 
ei,u + ei + ej). 
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For the converse, we first show that if the matrices Pi, ... , P m commute, 
then the transition matrix P is reversible; that is, there are positive constants 

b u such that b u P(u, v) = b v P(v, u) for all u,v G E. Write =U\-\ h u m 

for u = (tii, • • • ,Um) £ E. We claim it is possible to construct the constants 
b u , u G E, by induction on the value of ||u||, and that their values are 
determined once bo is specified. For u G E with ||n|| = 1, we must take 
b u = boP(0,u)/ P(u,0). For v G E with ||u|| = 2, we need to be able to find b v 
such that b u P(u,v) = b v P(v,u) for all tt G E with ||tt|| < 1. This is equivalent 
to showing that we can choose b v so that b u P(u, v) = b v P(v, u) for all u G E 
of the form u = v — et for some k. If v = 2e& for some k, then we must take 

P(e fc ,2e 2fc ) P(0,e fc )P(e fc ,2e 2fc ) 
" efc P(2e fc ,e fc ) V(e fc ,0) P(2e fc ,e fc ) ' 

If v = ej + e,- for i ^ j, then it is necessary that 



b, 



P(ei,ei + ej) P(0, e») P(ej, + ej] 



and 



P(ei + ej, ej) P(ej, 0) P(ej + e,-, ej) 
P(ej,ej + ej) _ P(0,e j ) P(ej,ej + 



P(ei + ej,ej) " u P(ej,0) P(e, +ej-,ej) 

are equal, in which case we must take 6^ to be the common value. However, 
from (3.1) we have 

P(0, ej)P( ei , ej + ej ) = P(0, e^P^^i + e-,) 

and 

P(ei + ei ,ej)P(ej,0) = P(ej + e i ,e i )P(e i ,0). 

Hence the value of b v is uniquely defined. Continuing in this way shows that 
if w G E and = wq, w±, . . . , wn = w with wm+i — wm G {ei, . . . , e m } for 
< M < N- 1, then the value of 

1 P(u>Af,tUM+l) 



n 



does not depend on the choice of w±, . . . ,wn-i- Moreover, if we take b w to 
be this common value, then the collection of constants b u , u G E, is such 
that b u P(u,v) = b v P(v,u) for all u,v G E, and this is the unique collection 
with that property and the given value of &o- 

Now, suppose that we have positive constants b u , n G E, such that b u P(u, v) 

— 1/2 

b v P(v,u) for all u,v G E. If we set a u = b u and define W(u, v) by 
(3.3) P(u,v) = a u W{u,v)a-\ 
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then 

(3.4) W(u,v) = W(v,u). 

Conversely, if we have constants a u and W(u, v) that satisfy (3.3) and (3.4), 
then 

P(u,v) _ g n W(-u,t;)a~ 1 _ 2 -2 

P(v,u) ~ a v W(v,u)aZ 1 ~ ^ ' 

when u ~ v, and so a~ 2 P(u,v) = a~ 2 P(v,u), u,v £ E. From what we have 
argued above, this implies that there are constants, a u , u£ E and W(u, v), 
u,v G E, that satisfy (3.3) and (3.4); the a u are unique up to a common 
constant multiple, and the W(u,v) are unique. 

To complete the proof, we need to check that the W(u,v) have the ad- 
ditional properties listed in the statement of the theorem. Because P(u,v), 
u, v £ E is a common zero of the polynomials in (3.1), it follows that W(u, v), 
u, v 6 E is a common zero of the same set of polynomials. The constraints 
that are associated with a particular two-dimensional face of one of the 
hypercubes in the grid with vertices {u, u + e^u + ej,u + e$ + ej} are 

W(u,u + ei)W(u + ei,u + ei + ej) 

= W(u,u + ej)VF(u + ej,ii + ej + ej), 

VF(u + ei,u + ej + ej)W(u + ej + ej,u + ej) 
= W(u + ej,n)M / (n,u + ej), 

W(u + ei + ej,u + ej)Ty(u + e^it) 

= VF(u + ei + ej , it + ei)W(u + e i5 w), 

W^(u + ej, u)M^(u, u + ej) 

= W(u + ej,u + ei + ej)W(u + ej + ej,u + ej). 

Because W(w, = W(v, w) for all u,v £ E, these four constraints are equiv- 
alent to 

W(u, u + ei)W(u + ei,u + ei + ej) = W(u, u + ej)W(u + ej,u + ei + ej) 
and 

W(u + ej, u + ei + ej)VF(u + ej, it + ej + ej) = W(u, u + ej)W(u, u + ej) 
and hence to the equations 

W(u,u + et) = W(u + ej,u + ei + ej) 

and 

V7(m,u + ej) = W(u + ei,u + ej + ej). 

Iterating these two equations yields the remaining assertions stated in The- 
orem 3.1. □ 
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Remark 3.3. Note that if u — v G {±e/c}, then {u,v} = {he k + r,(h + 
+ r} for unique values of h € {0, 1, . . . , — 1} and r £ Xlfe^fe ^ e £- There- 
fore, if the necessary and sufficient conditions of Theorem 3.1 hold, then 
VF(ii, v) = W(hek, (h + l)efc). Thus the parametrization involves n\ + ri2 + 
• • - + ™m uniquely defined parameters of this form and {n\ + l)(n2 + 1) • • • (n m + 
1) parameters of the form a u . The latter are uniquely defined up to a com- 
mon multiple. 

The set of constraints (3.1) has some redundancies when the unknown 
quantities P(u,v), u ~ v, are positive. For example, any three of the equa- 
tions in (2.1) implies the fourth. In what follows, we present a linear al- 
gebra approach to identifying and representing constraints that are inde- 
pendent. Note first that the quadratic equations in (3.1) are all of the 
form P(a,b)P(b,c) — P(a,d)P(d,c) =0. This condition is equivalent to 
[P(a, b)P(b, c)]/[P(a, d)P(d, c)] = 1 or Q(a, b) + Q(b, c) - Q(a, d) - Q(d, c) = 
where we write Q(u,v) for the logarithm of P(u,v). This suggests the fol- 
lowing encoding of (3.1) as rows of a matrix with entries from { — 1,0, +1}. 



Notation 3.4. Let 5' ni '" ,,nm ' be the matrix that has one row for each 
polynomial in (3.1) and columns indexed by ordered pairs (u,v) £ E x 
E with u ~ v, with the row corresponding to a polynomial of the form 
P(a,b)P(b,c) — P(a,d)P(d,c) having entries +1, +1, — 1, — 1 in the columns 
associated with the pairs (a, b), (b, c), (a, d), (d, c) and zero entries elsewhere. 

Remark 3.5. The matrix sOhi-."™) has format 




We wish to determine the rank of S^ ni >"'' Um \ that is, the dimension of the 
vector space spanned by the rows, as this gives the size of a maximal set of 
independent constraints for positive P(u,v). 
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Example 3.6 (2x1 grid constraint matrix). The 8 x 14-matrix S^ 2 ' 1 ) 
has rank 6 and equals 





Roo 


R01 


Rio 


R11 


Ao 


Ai 


Ao 


Ai 


AioAo 


— Ao All 


1 


-1 




















AoAi 


- Ao Ao 














-1 


1 








AiAi 


— A1A0 














-1 


1 








An Roo 


— An Ai 


1 


-1 




















Rio Ao 


— Ao Ai 








1 


-1 














AoAi 


- L20U10 




















-1 


1 


L21D11 


— D21L20 




















-1 


1 


Ai Ao 


— R11D21 








1 


-1 
















Uoo 


u w 


u 20 


An 


Ai 


An 






A0A0 


— Ao Al 


-1 


1 


















A0A1 


- Ao Ao 


-1 


1 


















A1A1 


— A1A0 











1 


-1 









AlAlO 


— R01D11 











1 


-1 









R10U20 


- UiqRu 





-1 


1 















A0A1 


- L20U10 





-1 


1 















L21D11 


— D21L20 














1 


-1 






Ai Ao 


— R11D21 














1 


-1 







This matrix encoding of the constraints suggests a similar encoding for 
the parametrization. 

Notation 3.7. Let ,4( n i>-.«m) a ma t r i x that has one column for each 
ordered pair (u,v) £ E x E with u ~ v and one row for each of the parameters 
identified in Remark 3.3; the column corresponding to the pair (u,v) has +1 
in the coordinate corresponding to the parameter a u , —1 in the coordinate 
corresponding to the parameter a v and +1 in the coordinate corresponding 
to the parameter W(u,v). Such a column records the parametrization of 
P(u,v) in (3.2). 



Remark 3.8. The matrix J[i n u-,nm) ^ as format 



in 



Y[(rii + 1) + J2 n i) x |2^niJJ(n i + l) 

U=l i=l / \ i=l j^i / 
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Example 3.9 (2x1 grid parametrization matrix). The 9 x 14-matrix 



Poo Roi Rio Rn Pio L\\ L20 L21 Uqo Uiq U20 An D\\ D21 



a 00 


1 











-1 











1 








-1 








«01 





1 











-1 








-1 








1 








«10 


-1 





1 





1 





-1 








1 








-1 





«11 





-1 





1 





1 





-1 





-1 








1 





«20 








-1 











1 











1 








-1 


«21 











-1 











1 








-1 








1 


hi 


1 


1 








1 


1 


























h 2 








1 


1 








1 


1 




















Vl 


























1 


1 


1 


1 


1 


1 



The following result is essentially a restatement of Theorem 3.1. It will 
enable us to compute the maximal number of linearly independent rows of 
<j(m,...,n m ) £ rom ^ rank of if ni, -'" m '. 

Corollary 3.10. The two vector spaces spanned by the rows of the 
matrix S^ niv "' nm ^ and the rows of the matrix i'" 1 ''"'' 1 '"' are the orthogonal 
complements of each other. 

Proof. First note that each row vector of A is orthogonal to the row 
space of S. This follows directly from the parametrization given in Theorem 
3.1, so it suffices to prove that any vector w S ker(5) is in the rowspan of 
A- Let k x I denote the size of the matrix A. Take w G ker(5) and consider 
g = (e- Wi )i=i ...j, the componentwise exponentiation of w. We will construct a 
transition matrix P from w. The transition matrix P has the nonzero entries 
P(u,v) u ~ v equal to the corresponding entries of g. Since w S ker(5), the 
matrix P can be decomposed into a sum of transition matrices Pi, ... , P m 
(one in each coordinate direction) which commute pairwise. So by Theorem 
3.1, we have g = (u ai , . . . , u ai ) where u is the vector of vertex and edge weight 
parameters, a±,...,ai, are the columns of A, and we take componentwise 
exponentiation. We conclude that w = log(g) = (log(ti), . . . ,log(tk))A is a 
linear combination of the rows of A. □ 

Lemma 3.11. The rank of the matrix A^ ni '""' nm ^ is one less than the 
number of rows, that is, 



_4.( 2,1 ) has rank 8 and equals 



rn 



111 



(3.5) 




i=i 



i=l 
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Proof. By [12], Remark 0.4.6(d), it suffices to show that the Gram 
matrix ^ nir "' nm ' ^{ni,—,n m ) h as ran k equal to the number in (3.5). Suppose 
that x is a row of „4.( ni >---, n m) corresponding to the parameter a u for a vertex 
u of the grid E. The inner product x ■ x is twice the vertex degree, that is, 
the number of v G E with u ~ v — because a u appears in the two transition 
probabilities P(u,v) and P(v,u) for each such v. If x' and x" are two such 
rows, then the inner product x' ■ x" is if u 96 v and it is — 2 if u ~ v. 

Now suppose that y is a row of ^y 11 '"' ' nm ' corresponding to one of the 
edge parameters W(he k , (h + l)e^). The inner product y-y is 2f|^ fc (n£ + 1) 
because the edge parameter appears in the transition probabilities P(hek + 
r, (/i + l)e/c + r) and + l)efc + r, /le^ + r) for n^fc( n ^ + 1) choices of 
r £ Y2e^k^ e t- ^ y' anc ^ are ^ wo such rows, then y' ■ y" = 0, since each 
transition probability only involves a single edge parameter. For the same 
reason, y-z = if z is a row of ( 4.( n i>—> n »») corresponding to an edge parameter 
WO'e/, 0' + with^/c. 

Lastly, if x and y are as above, then x ■ y is clearly unless the vertex ii 
associated with x is in the set {he^ + r : r G X^fc ^ e ^} U {(/i + l)e& + r : r G 
X^fc2^} of starting and ending points of transitions that involve the edge 
parameter associated with y. If the vertex u is in this set, then x ■ y = (+1 x 
+ 1) + (+1 x — 1) = also, with the +1 coming from a transition probability 
of the form P(u,v), and the —1 coming from a transition probability of the 
form P(v,u). 

Let X and 3^i,---,3^m be the submatrices of J\y n ^—,nm) that correspond 
to the rows for the vertices of the grid graph E, and the parallel edges in 
direction ei, . . . , e m , respectively. We have shown that y%yj = ^Wj^i{ n j + 
1)/ where / is the identity matrix of appropriate size, and XX T = 2(A - 
J) where A is the diagonal matrix listing the vertex degrees and J is the 
adjacency matrix of the graph. Thus, XX T is a multiple of the Laplacian 
matrix of the graph, and because the graph is connected, its rank is one 
less than the number of vertices [9], Lemma 13.1.1. The matrices Xy? and 
yiyj are all when i^j. We conclude that J^ n ^^ n m) J^ii,...,n m ) j s a 
block diagonal matrix, with a block of rank n^=i( n « + -0 ~~ 1> one ^ ess t nan 
its size, and m blocks of full rank ni, . . . ,n m , respectively. Therefore, the 
rank of J\i n ^r--^ m ) j^nx,...,^) j g p rec i se ly the quantity in (3.5). □ 

Corollary 3.10 and Lemma 3.11 imply the following result. 
Proposition 3.12. The rank of the constraint matrix 5( n i.---,«m) eaua \ s 

m / \ m m 

(3.6) 2 ( rn Y[(nj + 1) ) - Y[(ni + 1) - J^ n i + L 

i=l ^ j^i ' i=l i=l 
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Remark 3.13. The number (3.6) has the following geometric interpre- 
tation. We regard E as the edge graph of a cubical cell complex. The ^-cells 
of that complex have the form 

{u + \ie kl + X 2 e k2 H h X e e ke :0 < A x , A 2 , • • • , X e < 1} 

for u G E and basis vectors e^, . . . ,e k( such that u + Sj=i e fcj e ^ can 
be shown that 

m 

(3.6) = ]T(-l)^+l)#£-cells, 

£=2 

but we omit the proof. The reasoning behind this alternating formula, which 

is reminiscent of an Euler characteristic, can be used to select a row basis 
of l 5(ni,...,n ro )_ 

4. The toric ideal. In this section we examine the matrix A = AS ni '"' ,nm ' 
from the perspective of combinatorial commutative algebra [6, 20]. Let R[P] 
be the polynomial ring over the real numbers M generated by the unknowns 
P(u,v) with u,v £ E and u~v. We write I a for the toric ideal associated 
with the matrix A. Thus I a is generated by all binomials P z+ — P z ~ where 
we take componentwise exponentiation, and Z = Z + — Z_ runs over all in- 
teger vectors in the kernel of A. Among these binomials are the quadratic 
binomials in (3.1) whose corresponding vectors Z are the rows of the ma- 
trix S = ,s( n i>— >"•"•>). These quadratic binomials do not suffice to generate 
the toric ideal I a, and one of our objectives is to identify a generating set 
of binomials. In algebraic statistics [5], Section 1.3, one considers several 
generating sets of I a- Minimal generating sets are known as Markov bases. 
However, it is often more natural to study the larger Graver basis, which also 
contains all circuits of the integer kernel of A, and all reduced Grobner bases 
of I a . Recall that the circuits of a sublattice of Z N are the primitive vectors 
of inclusion- minimal support. Here "primitive" means that the coordinates 
have greatest common denominator equal to one, so there are finitely many 
circuits, and they are unique up to sign. For an elementary introduction to 
toric ideals we refer to [20] , and for their interpretation in terms of Markov 
chains, see [3, 5]. 

Example 4.1 (2x1 grid). If m = 2, m =2 and n 2 = 1, then R[P] is a 
polynomial ring in 14 unknowns and I4 is the ideal of 2 x 2-minors of the 
two matrices in (2.2). Thus the Markov basis of A has 12 elements. The 
Graver basis of A has 29 elements, listed in Example 4.12 below. 

The space of commuting birth-and-death processes with positive proba- 
bilities is the set of positive real points on a toric variety. The ideal repre- 
senting that toric variety is the toric ideal The toric variety V{IX) is 



COMMUTING BIRTH-AND-DEATH PROCESSES 



17 



D-dimensional in R where N = 2 Y^JiLi n i Y\j^i(. n j + -0 an d D = ni^=i( n « + 
1) + ^2i=± n i ~ 1- These numbers were derived in Section 3. Moreover, a 
parametrization of the toric variety of commuting birth-and-death processes 
was given in (3.2). 

We start our study in this section by reviewing two definitions regarding 
matrices in general. 



Definition 4.2. A Cayley matrix is a special block matrix of the form 



1 ... 1 










1 ... 1 






* 




* 






















1 ... 1 



By changing the order of the columns of our matrix A, we can see that A 
is a Cayley matrix where the blocks correspond to the parallel edges in the 
lattice. Toric varieties associated with Cayley matrices have a nice, special 
structure, and they appear in applied contexts such as chemical reaction 
networks [2], Theorem 9. Here is an even more important special structure. 

Definition 4.3. An integer matrix of rank d is unimodular if all its 
nonzero d x d minors have the same absolute value. There are many equiv- 
alent characterizations. For instance, a matrix is unimodular if and only if 
all initial monomial ideals of its toric ideal are squarcfrcc [20] , Section 10. 

Investigating whether a matrix is unimodular is interesting from the per- 
spective of algebraic statistics and computational algebra. One reason is the 
following result which is proven in [20]. 

Proposition 4.4. Let A be any unimodular matrix. Then every reduced 
Grobner bases of the toric ideal I a consists of differences of squarefree mono- 
mials. Moreover, the following three sets coincide: the union of all reduced 
Grobner bases, the set of circuits and the Graver basis of A. 

It is thus natural to ask whether our matrix is unimodular. We address 
this question as follows: 

Theorem 4.5. The Cayley matrix A = i' ni '"''" m ' is unimodular if and 
only if the dimension of the grid equals m = 2 and the format of the grid is 
either 2x2 or n x 1 for some n > 1. 
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Our proof of this theorem employs tools from matroid theory. Recall that a 
matroid M is a pair (£,X) consisting of a finite ground set £ and a collection 
X of subsets of £ that satisfy: 

(i) 61. 

(ii) If h el and I 2 C I x , then I 2 G X. 

(hi) If I\, I 2 G X and #/i < #^2; then there is an element e£ I 2 \h such 
that h U {e} G X. 

The members of X are the independent sets of M, and subsets of £ that are 
not in X are dependent. A maximal independent set in M is a frasis of M, 
and a minimal dependent set a circuit of M. A particular class of matroids 
arises from matrices as follows. The ground set £ consists of the columns 
of a matrix A, and X is the set of linearly independent subsets of column 
vectors of A. 

The following three results from matroid theory will be useful for the proof 
of Theorem 4.5. These and further properties of matroids can be found in 
[18]. 

Lemma 4.6. Let £ be a set and C a collection of subsets of £. Then C 
is the collection of circuits of a matroid on £ if and only if C satisfies the 
following three conditions: 

(i) 0?C. 

(ii) // Ci,C 2 G C and d C C 2 , then C X = C 2 . 

(hi) If Ci,C 2 G C distinct and e G C\ n C 2 , then BC-j G C such that C3 C 
(CiUC 2 )\{e}. 

Lemma 4.7. The collection B of bases of a matroid M satisfies the fol- 
lowing two conditions: 

(i) B is nonempty. 

(ii) // Bi,B 2 G B and e G B 1 \ B 2 , then 3f G B 2 \ B x such that (B 1 \ 
{e})U{f}GB. 

Lemma 4.8. Let M be a matroid over a set £ and B a basis of M . 
If e G £ \ B, then B U {e} contains a unique circuit C(e,B). Moreover, 
e£C(e,B). 

Our goal is to describe the circuits of the matroid associated with the 
matrix A = A ( ni >— > n "»). The ground set £ is the set of column vectors of A. 
We identify £ with the set of directed edges in the grid E. Two directed edges 
in £ are called parallel if they point in the same direction, and they have 
the same edge parameter W(u,v); that is, directed edges (u',v') and (u",v") 
are parallel if v' — u' G {±e/%} for some k, and (u" , v") = (v! + w, v' + w) for 
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some w £ X^fc^ e ^- Recall that a cycle in a graph that is not necessarily 
simple is a subgraph in which every vertex has degree 2. 

Definition 4.9. Let C = c( n i>->™m) be the set of subsets C C £ with 
the following properties: 

(i) C is a disjoint union of pairs of distinct parallel directed edges. 

(ii) The set of undirected edges corresponding to C, where two directed 
edges (u,v) and (v,u) are replaced by two undirected edges, is a union of 
edge disjoint cycles. 

(hi) If B C C, then at least one of (i) or (ii) does not hold for B. 

The set C^ 2,1 ^ of such walks on the 2 x 1-grid has 29 elements, listed in 
Example 4.12 below. 

Proposition 4.10. The set C is the set of circuits of a matroid on 8. 

Proof. The collection C satisfies the three conditions in Lemma 4.6: 
Condition (i) is trivial, (ii) comes from the minimality requirement (iii) in 
the definition of C, and (iii) holds because by eliminating the shared directed 
edges in two overlapping elements, C\, C2 £ C, we get a collection of directed 
edges, which satisfies (i) and (ii) of Definition 4.9 and hence contains an 
element of C. Thus C is the set of circuits of a matroid on 8. □ 

In an early stage of our project, we believed that Proposition 4.10 charac- 
terizes the matroid of A. Later we found out that this is true over a field of 
characteristic two but generally false over the real field R we are interested 
in here. However, when A is unimodular the matroid is independent of the 
characteristic of the ground field. Here is one instance where this happens. 

Lemma 4.11. For a grid of size nxl, the setC equals the set of circuits 
of the matroid of A. 

Proof. For nxl grids, the condition (iii) in Definition 4.9 implies that 
the set of undirected edges corresponding to a circuit C G C is a union of, at 
most, two cycles. For each C E C we construct a vector Vc in {0, +1, — 1}^ 
whose support equals C and that lies in the kernel of A. The vector Vc 
is constructed as follows: we choose a cycle in C and walk along this cycle 
clockwise. We assign +1 to forward-pointing edges and —1 to backward- 
pointing edges. If C consists of two cycles, we walk along the second cy- 
cle counterclockwise and assign +1 to forward-pointing edges and —1 to 
backward-pointing edges. This ensures that Vc has zero inner product with 
each row of A that is indexed by a vertex parameter a v . Parallel pairs of 
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edges receive opposite signs because they point in the same direction. This 
ensures that Vc has zero inner product with each row of A that is indexed 
by a parameter W(u,v). Hence Vc lies in the kernel of A. 

The construction of Vc reveals that the columns of A corresponding to 
any proper subset of C are linearly independent. This implies the inclusion 

C C {minimal linearly dependent subsets of column vectors of A}. 

But the reverse inclusion D holds as well. Consider any V S ker(.A) \ {0}. 
We interpret V as a multiset of signed directed edges in £ where each entry 
denotes the number of the corresponding directed edges. These edges come 
in parallel pairs of opposite sign because V has zero inner product with the 
rows of A that are indexed by parameters W(u,v). The undirected graph 
underlying V is a union of cycles because V has zero inner product with the 
other rows. Hence the support of V must contain some circuit C £ C. This 
completes the proof of Lemma 4.11. □ 

Proof of Theorem 4.5. To prove the "only if" direction, it suffices 
to give two examples of circuits which are not squarefree, one for the 3x2 
grid and one for the lxlxl grid. These examples are also circuits for 
larger two-dimensional grids, and for grids of dimension m > 3, respectively. 
Then, by Proposition 4.4, we conclude that the corresponding matrix A is 
not unimodular. 

(i) 3x2 grid: the following nonsquarefree binomial is a circuit: 

£10^30^01 ^2oA)2-D3i — £12-^32^10^11^31 A)l-D22- 

(ii) lxlxl grid: the nonsquarefree binomial 

-^000-^010-Fioo-Booi — -Rooi-Roii-Diio-Fbio-Siii 

is a circuit where B and F denote the additional backward and forward 
moves. 

We now prove the "if" direction for the n x 1 grid. Let k denote the 
number of rows of the matrix A = A^ 1 ' 1 ' ■ Then rk(.A) = k — 1 by Lemma 
3.11. We must show that all nonzero (k — 1) x (k — 1) minors of A have 
the same absolute value. The row vectors of A corresponding to the vertices 
of the grid are not independent; they sum to 0. So in order to get nonzero 
minors, we delete the row corresponding to the vertex ao,...,o- The resulting 
matrix is denoted A '. 

In Lemma 4.11 we characterized the circuits of the matroid on the column 
vectors of A' for n x 1 grids. Consider two distinct bases B and B' of this 
matroid. To finish the proof, we need to show that the determinants of the 
submatrices of A' corresponding to these bases have the same absolute value. 
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Let e £ B\B' . So by Lemma 4.7, 3f£B'\B such that B 1 = (B\ {e}) U {/} 
is a basis. This construction can be carried on until B^ = B' ' . So without 
loss of generality, B\ = B' . By Lemma 4.8, BU {/} contains a unique circuit 
C and / £ C. Because i?i is also a basis, we get by the same argument 
that e € C. The corresponding vector Vc in kernel(.A) has coordinates ±1 in 
positions e and /. Hence we can replace the column vector e by the column 
vector / without changing the absolute value of the determinant. In symbols, 
|det(J3)| = |det(J5')|. 

To finish the proof, we need to show that the 13 x 24 matrix A for the 
2x2 grid is unimodular. This can be verified with the help of a computer 
algebra system such as CoCoA or 4ti2. Namely, we compute the Graver bais 
of A, and we observe that all binomials in that Graver basis are squarefree. 
Since the Graver basis contains every reduced Grobner basis, this implies 
that every initial monomial of I a is squarefree. Hence, by [20], Remark 8.10, 
the matrix .A' 2,2 ) is unimodular. □ 

We conducted a further computational study of the toric ideal lj± for the 
two minimal grids whose matrices A are not unimodular, namely the 3x2 
grid and the lxlxl grid. In what follows, we present two tables that show 
the total number of binomials of each degree for the Graver basis, the set 
of circuits and the minimal Markov basis. In parenthesis are the numbers 
of squarefree binomials in these bases. The two tables were computed using 
the software 4ti2. 

3x2 grid: 









Degree 






2 3 


4 


5 6 7 


8 


Graver basis 45 (45) 128 (128) 464 (464) 1600 (1600) 3904 (3904) 4928 (4032) 1088 (192) 
Circuits 45 (45) 128 (128) 464 (464) 1600 (1600) 3904 (3904) 4928 (4032) 896 (0) 
Markov basis 36 (36) (0) 4 (4) 4 (4) 4 (4) (0) (0) 


lxlxl 


grid: 














Degree 






2 


3 


4 5 


6 


Graver basis 
Circuits 
Markov basis 


42 (42) 
42 (42) 
33 (33) 


224 (224) 
224 (224) 
8(8) 


1032 (1032) 1728 (1152) 
1032 (1032) 1728 (1152) 
12 (12) (0) 


672 (96) 
576 (0) 
0(0) 
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Note that the Markov basis in both examples consists of squarefree bino- 
mials only and that the number of binomials with squares of degree 7 and 
8 for the 2x3 grid, respectively of degree 5 and 6 for the lxlxl grid is 
the same. We wonder if this also holds for higher dimensions. 

We end this section by illustrating the conclusion of Theorem 4.5 for our 
running example. 

Example 4.12 (2x1 grid). The set C( 2,1 ) of circuits of the 8 x 14-matrix 
_4( 2 > 1 ) ) i n Example 3.9, consists of 29 elements. Each circuit C corresponds to 
a unique (up to sign) vector Vc in {0, +1, — l} 14 n kernel^^ 2 ' 1 ^), and hence 
to a difference of squarefree monomials in 

R[P] = R[Rqo, Rqi, R\o, Rn,Lio,Ln,L2o, L21, Uoo, Uiq, U20, Uoi,Dn,D2i\. 

We now list the 29 binomials in the Graver basis C^ 2 ' 1 ) of 14(2,1) • First, 
there are the eight quadrics which generate i^ 2 ' 1 '. These correspond to the 
eight circuits of type C\. Next, there are five circuits of type Ci- These are 
displayed below quadratic binomial and ELS cl picture: 



(1) -Roo-kio — -Roi-^ii 



(2) R10L20 — -R11L21 



(3) D 1 iUio-D 01 U 00 



(4) D21U20 - D 01 U 00 

(5) D2 1 U2 -D 11 U W A 1 1 . 

The remaining 16 circuits are given below with the corresponding bino- 
mials in the Graver basis. 



(1) -Roo-Rio^20 — RoiRuUqo 

t* T 







(2) L10L20U00 — L11L21U20 



COMMUTING BIRTH-AND-DEATH PROCESSES 

:: — *r* 

(3) R00L21U20 — R01L20U00 



23 



(4) R10L11U20 — RiiLiqUqq 



(5) R10D11U20 — RuDqiUoq 



(6) L21D11U20 — L20DQ1II00 



(7) R00D21U20 — RoiDnUoo 



(8) L11D21U20 — LiqDiiUqo 



(9) RqqRiqDqi — R01R11D21 



(10) L10L20D21 — L\\Li\D\ 



01 



(11) -Roo-^21 Ah — R01L20D21 



(12) RiqLuDqi — R11L10D21 



(13) R11D21U10 — RiqDqiUqq 



(14) L 20 D 2 iU w - L2iD 01 U 00 



(15) R01D21U20 - RooDoiU w 
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(16) L 10 D 2 iU 2 o -LnAu^io i 1 • 

Similar lists of circuits can be computed, at least in principle, for larger 
and higher-dimensional grids. However, their combinatorial structure will 
be quite complicated. To appreciate this, we invite the reader to draw the 
picture for the circuit of the 4 x 4-grid displayed in (5.6) below. 

5. Primary decomposition and boundary components. The set of com- 
muting birth-and-death processes is the set of nonnegative real solutions of 
the quadratic equations (3.1). In the previous two sections we have focused 
on the set of strictly positive solutions, and we have argued that its clo- 
sure is a toric variety V(IX) with interesting combinatorial properties. In 
this section we apply binomial primary decomposition [3, 6] to classify those 
nonnegative solutions of (3.1) that are not in the closure of the positive 
solutions. 

Notation 5.1. Let /("-i >•••>"•»>) d eno te the ideal in the polynomial ring 
K[P] that is generated by the quadrics in (3.1). Here K is allowed to be an 
arbitrary field. The ideal /( re ii— .^m) j s a binomial ideal in the sense of [6]. It 
represents all commuting birth-and-death processes. 

To keep things simple, we first concentrate on the two-dimensional case. 
We use the notation of Section 2 for anmxn grid. Then /( m ' n ) is a binomial 
ideal in the polynomial ring L, D, U] over an arbitrary field K. The set 
of indeterminates equals 

{Rij : < i < m, < j < n} U {L^- : < i < m, < j < n} 

U {Dij : < i < m, < j < n} U {Uy : < i < m, < j < n}. 

The ideal /( m,n ) is minimally generated by the following set of 4mn quadratic 
binomials 

{UijRij+x — RijUi+ij, Rij+iDi + ij + i — Dij + \Rij, 
Di+i t j+iLi + ij — Li + ij + iDij + i, 
U+i,jUij - U i+ ijL i+1 j +1 :0 <i<m,0<j<n}. 

As a warm-up, let us discuss the simplest case, namely the four binomials 
on a single square. 



COMMUTING BIRTH-AND-DEATH PROCESSES 



25 



Example 5.2 [(m 
pairs of matrices 



(5.1) 



n ■ 



1)]. The variety V^ 1 ' 1 *) C K 8 consists of all 




and 



I ° 

An 



V o 



Ao 










Ai 






Ao 




that commute. The possibilities for this to happen are revealed by the pri- 
mary decomposition, 

jC 1 ' 1 ) = (U 00 R 01 — Rq Uio, i?oiAi 

— Ai-Rooj Ai Ao — Ai Ai> AoAo — AoAi) 
= A n (A)0) Aoj Ai) Ai) n (A)0i An> Ao; Ai)- 

Here 7.4 is the toric ideal of the matrix A = A^ 1 ' 1 ^. It is generated by the six 
2 x 2-minors of the matrix, 



(5.2) 



Ao Ao Ai Ali 
An Ao Ao An 



We see that y^ 1 ' 1 )) is the union of three irreducible components. The main 
component V(A) has dimension 5 while the two boundary components have 
dimension 4. The two matrices (5.1) commute if and only if either one of 
them is the zero matrix, or the matrix (5.2) has rank < 1. 

An ideal in a polynomial ring is radical if and only if it is a finite inter- 
section of prime ideals. We say that a pure toric ideal is a prime ideal which 
is generated by indeterminates and differences of monomials. For example, 



(xy 



is pure toric, but (xy + z ) and (x 



are not. 



Conjecture 5.3. The ideal /( m,n ) is radical, all its associated primes 
are pure toric ideals and the prime decomposition of i~( m ' n ) in K.[R,L,D,U] 
is independent of the coefficient field K. 



Example 5.2 establishes this conjecture for m = n = 1. What follows is 
devoted to establishing partial results that support Conjecture 5.3 and to 
explore the combinatorics of the irreducible components of our binomial 
variety V~(/( m > n )) of commuting birth-and-death processes. 

Example 5.4 [(m < 3, n = 1)]. The radical ideal i^ 2 ' 1 ) is the intersection 
of 11 pure toric ideals. These 11 primes are easily found in Singular or 
Macaulay2. They come in six symmetry classes: 
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1. The main component has codimension 6 and degree 16. It corresponds 
to the rank one condition for two matrices as in (5.2), one for each of the 
two squares in the 2 x 1-grid, 

I A = (2x 2-minors of ( ^ 00 ^ 00 \ 11 ^ n 
\ Uio Lio L)qi 



and of 



-Rio U±o L 



21 



D 



21 



Rii U20 L20 Du 



2. There are two components of codimension 7 and degree 4, each isomorphic 

times a four-dimensional coordinate subspace. One of these 
two primes is the pure toric ideal generated by {L21, L20, Rn, Rio} and 
the six 2 x 2-minors of the matrix (5.2). 

3. The monomial component representing horizontal transitions has codi- 
mension 6, 



(Uoo, Uio, U20, A)ij Dix, D21) 

4. The monomial component representing vertical transitions has codimen- 
sion 8, 



{Roo, Rqi,Rio, Rii,Liq, Lii,L2o,L 



21/ 



5. There is another pair of monomial primes of codimension 8. One of these 
two ideals is 



{Roo,Roi,LiQ,Ln, Uio, U20, Dn,D2i) 
6. The last class consists of four monomial primes of codimension 7, such as 



(Rio 1-^21, Uoo ,U\o, An > Ai > Aa ) 

A similar computation shows that J^ 3 ' 1 ) is also a radical ideal. It is the 
intersection of 40 pure toric ideals. The list is analogous to that above. 
Here, the main component ^4 is generated by the 18 minors of the three 
2 x 4-matrices coming from the three squares in the grid. 

Current general-purpose implementations of primary decomposition are 
not able to perform the corresponding computation for the 2x2 grid. To 
crack this case, we applied the methods of [3, 6] in an interactive fashion in 
order to verify Conjecture 5.3 and find the prime decomposition. 
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Example 5.5 [(m = n = 2)]. The ideal j( 2,2 ) is radical and it is the irre- 
dundant intersection of 199 pure toric ideals. The toric ideal I a representing 
the main component is minimally generated by 26 binomials. Besides the 24 
familiar quadrics which come from the 2 x 4-matrices on the four squares 
of the grid, here we find the following two additional quartic Markov basis 
elements: 

R00L22U20D02 — R02L20U00D22 and 

(5.3) 

R10L12U21D01 — R12L10II01D11. 

We shall next derive one theoretical result related to Conjecture 5.3. Let I 
be a pure binomial ideal in a polynomial ring K[xi, X2, • • • , x r ], that is, / is an 
ideal generated by monomial differences x™ 1 x\^ • • • x™ r — x\ x x^ • • ■ x v r r . The 
lattice Ci associated with I is the sublattice of HI spanned by the vectors 
(u\ — vi, U2 — V2, ■ ■ ■ , u r — v r ) in any generating set of I. For any positive 
integer i < r, the elimination ideal / n K[x\, . . . is also pure binomial, 
and we have 

(5-4) £inK[xu...,Xi] =£1 n (Zei H h Zej). 

A result of Gilmer [8] states that the radical \fl of a pure binomial ideal I is 
also a pure binomial ideal, and, using the methods developed in [6], Section 
3, it is not hard to see that Ci = C rj. 

We say that a pure binomial ideal I is unimodular if the corresponding 
lattice Ci is a unimodular sublattice of 27 . By this we mean that the quotient 
group If /Ci is free abelian and the same holds for every elimination ideal 
of /. Equivalently, / is unimodular if the quotient group 7L{e s : s G a}/(Ci n 
Z{e s : s G <r}) is free abelian for every subset o of {1,2, ... ,r}. 

Proposition 5.6. If I is a unimodular pure binomial ideal in a poly- 
nomial ring K[xi, . . . , x r ], then every associated prime of I is a pure toric 
ideal. 

Proof. We abbreviate the polynomial ring by S = K[x±, . . . , x r ]. If I is 
toric then we are done. Otherwise, we follow the approach in the proof of 
[6], Theorem 6.1, and pick a variable Xi such that both (I:xi) and I + (xi) 
strictly contain /. Note that the lattice Ci + ^ x .^ equals £7 fl Z{ei, . . . , e%-\, 
ej+i, . . . , e r } and hence is unimodular. The exact sequence, 

(5.5) — > S/(I : Xi ) — > S/I — > 5/(7 + (a*)) — > 0, 

shows that each associated prime of I is an associated prime of {I:xj) or of 
I + ( x i) ■ Both (I : X{) and / + (xi) are unimodular pure binomial ideals, and 
they satisfy the desired conclusion by Noetherian induction. Hence so does 
I. □ 
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We have the following result which generalizes the main conclusion of 
Example 5.4. 

Corollary 5.7. Every associated prime of the binomial ideal I^ 71 ' 1 ' is 
a pure toric ideal. 

Proof. In Theorem 4.5 we have shown that the lattice kernel^^™' 1 )) 
associated with our binomial ideal j( n ' 1 ) is unimodular. By Proposition 5.6, 
every associated prime of Jf"' 1 ) is pure toric. □ 

An overly optimistic conjecture suggested by Examples 5.4 and 5.5 would 
be that a minimal Markov basis for the main component I4 of I^ m ' n ' consists 
of the 6mn quadrics derived from the matrices (5.2) on all 1 x 1-subgrids and 
the 2(m — l)(n — 1) quartic binomials (5.3) derived from all 2 x 2-subgrids 
of the m x ra-grid. Unfortunately this is far from true. For instance, for 
m = n = 3 the Markov basis of I a consists of 314 binomials, including the 
54 quadrics and 8 quartics, but there are also 16 quintics, 36 sextics and 200 
binomials of degree eight, such as 

(5.6) U00II32L20L30L13L23R21R02 — ^30^02-^11-^21^22^32^00^23- 

It would be desirable to identify which circuits make up the Markov basis 
for arbitrary m and n. 

We next examine the situation for the smallest example of a three-dimensional 
grid. 

Example 5.8 (1 x 1 x 1-grid). The ideal - m g enera t e d by 24 

quadratic binomials (four for each face of the 3-cube) in 24 unknowns (two 
for each edge of the 3-cube) . We write 

_ (LioofTooo — f/ioo-^iioi £101^001 — ^loi-^ni) 
-^iioAho — -Diio-Liooj-kmAni — DmLioi, 
-^100-^000 — -Fioo-^ioi , iiio-Pbio — -Fiio-km, 
£ioi-E>ooi — -Bioi-Lioc-^iii-Bon — Bniiuo, 
-Rooo^ioo — t^ooo-RoiO) -Rooi^ioi — t^ooi-Roii) 
-Roio-Dno — Ajio-RooO) -Roii-Dm — A)ii-Rooi 5 

-^000-^100 — -F0OO-R0OI1 -RoiO-^110 — FqiqRqii, 

-R001-B101 — -B001-R000, i?oii -Bin — -E>on-Roio> 
-Bioi^ioo — £^101-6111,-6001 £^000 — f^ooi-Bon, 
-BonAno — DqiiBqoi, BmDiio — Dm-Bioi, 
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5)00^001 — ^000-^010)^100^101 — ^100^110) 

-^bio-Con — Aho-Foooj ^lio-^m — -DuoFioo)- 
The main component is a unimodular toric ideal of codimension 14 and de- 
gree 300. Its Markov basis consists of 53 binomials (33 quadrics, 8 cubics and 
12 quartics), and its Graver basis consists of 3698 binomials (42 quadrics, 
224 cubics, 1032 quartics, 1728 quintics and 672 sextics). 

We found that the binomial ideal i^ 1 ' 1 ' 1 ) is the intersection of 131 prime 
ideals, so it is radical. Besides the main component, there are 91 monomial 
primes and 39 other binomial primes: 

# cd deg representative associated prime 

2 12 1 (i?ooi, -Bm, Z?oiO) -Dm, Foio,iqoO) 

Lioo, Lm, Rqqi, Rqiq, Uooi, t/ioo) 

2415 1 (i?oii)-Bioi, -Bm, -Doiii -Duo, -^oooi-^ioo; -^110; 

Lioi, Luq, Lm, Rqqq, Rqqi, Rqh,Uooo} 
9 16 1 (i?ooi 5 -Bon, -Bioi, -Bm, -FooOj-^biOi-^ioo^no, 
-^ioo, -^101 , 6110,6111, -Rooo, Rooi , -Roio, Ron) 
48 17 1 (5101,-6111,1)010,-0011, -Dm^oio, -^100, -^no, 

6100, -^101, -^no, Lm, -Rooo, -Rooi, -Roio, 6ooo, 6001) 
8 18 1 (i?on, -Bioi, -Bm, A)io, Don, -Duo, -foio, -Fioo, -^no, 
Bioo, -^101, 6m, -Rooo, -^ooi, -Ron, ^ooo, 6001,6100) 

3 14 16 (5no,5ioo,5oio,5ooo, 6m, 6101, 6011,6001 

and 12 of the binomial generators of J^' 1 ' 1 )) 
12 15 4 (5ooo,5oio,-^ioo,-^iio,5ho,5ioo,5oio,5ooo, 

5m, 5ioi, 5on, 5ooi and six binomials) 

2416 9 (Lm,Lioi,5ooo,5oio,5oio,5ooo,-^iio,-^ioo, 

5no, 5iqo, 5m, 5ioi and six binomials). 



We end by restating the main point of Conjecture 5.3 for grids in the 
m-dimensional lattice. 

Conjecture 5.9. The binomial ideal l' ni, -' n " 1 ' is radical. 

Note added in proof: On May 4, 2009, Thomas Kahle announced that he 
had disproved Conjectures 5.3 and 5.9 using his new software for primary 
decomposition of binomial ideals. Kahle's result states that the ideal j( 2,3 ) 
is not radical. More precisely, this ideal is the intersection of 2638 primary 
ideals of which 11 are not prime. A witness is given by the binomial 

/ = D01R03R10L12U21L22D23 — C/oi5o3Llo5i352i-L23523- 

Using Macaulay 2 or Singular, we can check that f 2 e 1^ but / j( 2 - 3 ). 
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